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We present a new stochastic approach to describe and remodel the conversion process of a wind 
farm at a sampling frequency of IHz. The method is trained on data measured on one onshore 
wind farm for an equivalent time period of 55 days. Three global variables are defined for the 
wind farm: the 1-Hz wind speed u{t) and ten-minute average direction (f> both averaged over all 
wind turbines, as well as the cumulative 1-Hz power output P{t). When conditioning on various 
wind direction sectors, the dynamics of the conversion process u{t) —>■ P{t) appear as a fluctuating 
trajectory around an average lEC-like power curve, see section [II] Our approach is to consider the 
wind farm as a dynamical system that can be described as a stochastic drift/diffusion model, where a 
drift coefficient describes the attraction towards the power curve and a diffusion coefficient quantifies 
additional turbulent fluctuations. These stochastic coefficients are inserted into a Langevin equation 
that, once properly adapted to our particular system, models a synthetic signal of power output 
for any given wind speed/direction signals, see section jTTTj When combined with a pre-model for 
turbulent wind fluctuations, the stochastic approach models the power output of the wind farm at 
a sampling frequency of IHz using only ten-minute average values of wind speed and directions. 

The stochastic signals generated are compared to the measured signal, and show a good statistical 
agreement, including a proper reproduction of the intermittent, gusty features measured. In parallel, 
a second application for performance monitoring is introduced in section HYI The drift coefficient 
can be used as a sensitive measure of the global wind farm performance. When monitoring the wind 
farm as a whole, the drift coefficient registers some significant deviation from normal operation 
if one of twelve wind turbines is shut down during less than 4% of the time. Also, intermittent 
anomalies can be detected more rapidly than when using ten-minute averaging methods. Finally, 
a probabilistic description of the conversion process is proposed and modeled in appendix]^ that 
can in turn be used to further improve the estimation of the stochastic coefficients. 
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I. INTRODUCTION 

Wind energy is currently the fastest growing energy sector in Europe [1]. The fast integration of the complex 
wind resource in electric networks raises new challenges in terms of grid stability Hfij. Smart grid concepts are 
being developed to cope with the turbulent nature of wind power, that involve e.g. energy storage or smart control 
strategies. The challenge is on the one hand technical, as the current grid structure must evolve from a strongly 
polarized source/load configuration with few large power sources towards a delocalized distribution of small renewable 
sources [S]. New grid concepts must be designed to handle the increasing amount of fluctuating wind power, see e.g. 
Ref. [B] for an example on active wind power regulation. On the other hand, a fundamental understanding of the wind 
resource still lacks. Wind energy planing still focuses mostly on large-scale meteorological wind dynamics. Dynamics 
at faster time scales in minutes are typically simplified to a Gaussian wind field [7], bypassing the complex dynamics 
of turbulence [HHin]. Yet it is observed on measured data that fast wind fluctuations deviate from Gaussianity nni, 
and are instead intermittent multifractals 0111]. This observation must be coupled with the fact that modern wind 
turbine designs optimize power performance |12j by following these fast, intermittent wind fluctuations. Besides the 
obvious impact on the mechanical loads acting on the turbine machinery [num, such performance-oriented control 
strategy implies feeding intermittent gusts into the power grid [3] . Our increasing dependence on wind energy stresses 
the necessity to reliably predict first the wind dynamics, and second the resulting wind power production. 

The first aspect is a fundamental challenge that traditionally involves both fields of meteorology for large-scale 
dynamics and turbulence for fast fluctuations. The central problem is that wind dynamics involve a wide range of 
spatio-temporal scales that are entangled in an energy cascade. The standard picture involves a three-dimensional 
downward cascade at small scales and a two-dimensional inverse cascade at mesoscales. In between, a hypothetical 
spectral gap |15j would separate the two regimes. Some recent studies contradict this hypothesis and propose a uni¬ 
versal cascade model instead [min]. Besides solving directly Navier-Stokes equation including all external influences 
such as thermal transfers or topography (that is far from achievable), no simpler alternative is recognized as a valid 
method to describe the entire wind dynamics. Instead various meteorological models parametrize all the influencing 
effects, but they are limited in their resolution due to their high computational cost, nowadays resolving spatial and 
temporal scales of typically 1 km^ and 1 hour. The smaller structures (that remain quite large) are usually not 
modeled using GFD, but with a statistical model of turbulence, e.g. a Gaussian correlated field [7|. It should be noted 
that high-frequency wind models have been developed for decades m. and the turbulence community has proposed 
several approaches in recent years [T9H2I]- 

The second aspect consists in modeling the conversion process operated by the wind installation, seen here as the 
conversion of a wind speed u conditioned on the wind direction (j) into an electrical power output P. The complete 
conversion process is commonly simplified to an average power curve [22] in the case of a single wind turbine. Being 
an average curve, this approach is a good description of the long-term behavior, but naturally fails to describe the 
dynamics in the faster time scales of minutes 0 Ref. [23| recently proposed a prediction model for the low-frequency 
(30-minute) trend of a wind farm power output from weather data using a mixed statistical/GFD approach. It has 
been recently shown that the cumulative power output of a wind farm 0 or a 300-km large wind installation |25j 
contain intermittent fluctuations at time scales of minutes and even seconds that are typical of a multifractal process 
1^ The standard power curve method overlooks such effects, so it is impossible to predict precisely how stable the 
power grid will be at the short time scales at which it is operated. 

For the optimal exploitation of wind energy, it is also important to minimize downtimes. Servicing and spare parts 
were found to account for one fourth of operation and maintenance costs for onshore installations, i.e. about 1% of the 
total investment EZj. Ref. [55] made a 15-year-long study and showed that the 1500 onshore wind turbines studied 
had an average availability of 98% (downtimes occurred during 2% of the time). About half of the recorded failures 
were attributed to electrical and control systems, the other half coming from mechanical systems. It is interesting to 
note that failure rates were especially high for wind turbines of IMW and more. Recent results [29] show that the 
availability of offshore installations is much lower, in the order of 70 — 90%. This makes the development of procedures 
for early warning, identihcation and reparation of damages essential, see Ref. m for an overview of various detection 
methods. In particular, methods for the early prediction of emerging failures are very valuable in order to react 
optimally. There are strong indications that the turbulent nature of wind flows affects the mechanical fatigue of wind 
turbines M- This underlines the necessity to include the fast dynamics of turbulence in wind energy methods in 
order to further improve the reliability of wind energy systems. 


More accurate approaches involve an aero-mechanical description of each wind turbine m> which becomes excessively demanding for 
everyday prediction. 

^ This observation contradicts the intuitive yet false argument claiming that the fluctuations of neighboring wind turbines cancel out. 
Considering many wind installations separated by a distance shorter than the correlation length of atmospheric winds (in hundreds of 
kilometers according to Ref. dsi), these neighboring installations are driven by correlated wind fields and produce correlated outputs. 
They are not independent power sources, and their cumulative output does not sum up to a Gaussian process, that is the central limit 
theorem cannot be applied. 
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In this paper, we propose an alternative approach that describes and remodels the conversion process of a wind 
farm at a frequency of 1 Hz. Our analysis is performed on a wind farm that is described in section [llj We present a 
stochastic model for the conversion process in section [Till where the dynamics are intuitively characterized through a 
set of stochastic estimates. We show in section [IV] that these estimates are highly reactive to dynamical changes, and 
we promote them as well for condition monitoring of the power performance for a wind farm. Also, a probabilistic 
description of the power production is presented in appendix [A] 


II. WIND FARM DYNAMICS 
A. Data description 


All results presented in this paper were extracted from measurement data for one wind farm. The wind farm is 
installed onshore over an area covering roughly Akm?, and is surrounded by flat rural terrain. It consists of 12 identical 
variable-speed, pitch-regulated wind turbines. The rated power of each turbine is in the order of 2MW. Its exact 
value cannot be published following an agreement with the farm manager. For this reason, all values of power output 
will be normalized to the rated power of the entire wind farm = 100% ~ 12 x 2MW. 

Measurements were conducted synchronously at each wind turbine. The measured signals are the net electrical 
power output generated by each wind turbine, and the wind speed and direction measured on each nacelle by a cup 
anemometer and a wind vane. The operational status of each wind turbine was also provided, so that the data was 
rejected when one or more turbines were not operating in normal conditions. 

All measurements were performed at a sampling frequency fs = IHz. Unless stated otherwise, all the time series 
presented in this paper have a sampling frequency of IHz. The measurement campaign was conducted over a period 
of eight months, from June 2009 till February 2010. The measurements were regularly interrupted, eventually leaving 
7, 775 ten-minute periods (4, 665, 000 samples, or 53 days 23 hours and 50 minutes). 


B. Wind farm observables 


This paper aims towards wind farm dynamics, and not dynamics of single wind turbines. In order to describe the 
entire wind farm, some new observables were defined from the data measured on all single wind turbines. Ref. m 
observes that the average of all single wind speed and direction measurements is a more precise measure for power 
curve modeling than an eventual met mast measurement. Following this observation, we define three observables for 
the wind farm. The average wind speed over the entire wind farm is defined as 

1=1 

IHz by the cup anemometer on the nacelle of turbine i. Similarly, the 
farm is defined following 

1 ^ 

= ( 2 ) 

at IHz by the wind vane on the nacelle of turbine i. Finally, the total 
electric grid is 

N 

(3) 

i=l 

where Pi{t) is the net electrical power output of turbine i at IHz. N represents the total number of turbine mea¬ 
surements considered, in our case TV = 12 corresponding to all the turbines in the wind farm. The three signals 
{u{t), P{t)} are sampled at IHz. From now on, these three effective observables for the wind farm are named 

the wind speed u{t), the wind direction and the power output P{t). 


where Ui{t) is the wind speed measured at 
average wind direction over the entire wind 


where is the wind direction measured 
power output fed by the wind farm into the 
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C. Ten-minute dynamics 


While our analysis focuses on 1-Hz dynamics, we first analyze the wind farm dynamics on the basis of ten-minute 
averaged results. The ten-minute analysis gives a coarse picture of the wind farm dynamics. Within each ten-minute 
time period, the average wind speed u is calculated from the 600 samples u{t) following 


u = 


1 

600 


600 

■ 




(4) 


Similarly, the ten-minute averare wind direction (j) and power output P are calculated from and P{t). Histograms 
are presented in figures and^ for u and P. From the 7, 775 ten-minute periods measured, we can observe the 
Weibull-like shape of the wind speed histogram, as usually observed for atmospheric wind measurements. This is an 
indication that averaging over all turbine anemometers following equation Q conserves the wind speed histogram. 
The histogram of power output shows that power values below 20% are most probable, and that the probability of 
measuring a higher power value decays rapidly. However, it is interesting to note that power values up to 105% of 
the farm rated power are measured, in which case all turbines deliver slightly above their rated power. 



wind speed u [m/s] 



0 20 40 60_ 80 100 

power output P [%] 


FIG. 1. Histogram of ten-minute average wind speed u with 
a resolution An = 0.5 m/s. 


FIG. 2. Histogram of ten-minute average power output P 
with a resolution AP = 5 %. 


A histogram of wind directions ^ is presented in figure]^ for 12 wind sectors of size A0 = 30°. The most frequent 
wind directions lie within the South-West sector, as expected from the wind farm location. From now on, the data 
will be systematically separated for the 12 wind sectors k such that (fc — 1) • 30° < (j) < k ■ 30°. 

We define the lEC-like power curve for the wind farm by adapting the norm lEC 61400-12-1 originally designed 
for a single wind turbine H For each wind sector k of size A0 = 30°, we sort the ten-minute averages of wind speed 
u and power output P. We then average the ten-minute averages in each wind speed biiW of size Au = 0.5 m/s. 
This yields an lEC power curve PiEc{uj,^k) for each wind sector, as presented in figure H The power curves look 
similar to the typical lEC power curve of a wind turbine |23j . A cut-in wind speed is foundat around 4 m/s, and the 
rated farm power is reached at roughly 13 m/s, as specified for the single wind turbines in the farm. We observe some 
deviations in power performance of up to 10% depending on the wind direction. This is due to the various wake effects 
(reduced wind speed and power downstream of wind turbines) that change with the inflow direction. This implies 
that the global performance of the farm depends on the inflow direction, and modeling or monitoring applications 
must be direction-dependent. 


From now on, we refer to the lEC-like power curve as lEC power curve. We stress that it is not the original procedure, but our own 
adaptation of the lEC standard to a wind farm. 
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FIG. 3. Histogram of ten-minute average wind direction (j> 
with a resolution A(j> = 30°. The four cardinal directions are 
indicated for reference. 


FIG. 4. lEC-like power curve PiEc{uj,<j>k) with correspond¬ 
ing error bars for the wind sectors fe G {1,12}. 


D. IHz dynamics 

We focus in this paper on wind farm dynamics at IHz. This focus is motivated by observations of measurement 
data, as presented in figure The effective wind speed u{t) and power output P{t) display fast fluctuations at IHz. 
While the ten-minute averages u and P give a rough estimate, the actual dynamics of the wind farm happen on a 
faster time scale. The wind farm filters the fastest wind fluctuations in time scales of few seconds due to its inertia. 
However, wind speed changes happening within seconds and minutes are converted into power output changes. The 
high frequency data also shows wind gusts that are underestimated by ten-minute averages. It is the case in figure 
[^e.g. at time 100 min < t < 110 min, where the wind speed u(t) drops by 5m/s (while u only drops by 2m/s). In 
the same time, the power output P{t) drops by 60% (while P only drops by 30%). This indicates first that rapid and 
large changes are measured for the wind speed and power output, and second that ten-minutes averages obviously 
underestimate the amplitude of these rapid changes. 

The {u, P} space is common for power curve methods because it is the phase space representing the input/output 
variables u/P for a wind turbine. We use this representation for the wind farm in figure The fast IHz data 
fluctuates around the lEC power curve with deviations in power output of up to ±20%. This shows that large errors 
up to 20% can be generated by using an lEC power curve for power estimation / modeling. The 2h sample shown in 
figurej^is overlaid in figurej^ illustrating the volatility of the conversion process. We also observe that the conversion 
process u{t) P{t) is a highly dynamical process that portrays well how the wind farm functions. We promote the 
two variables u(t) / P{t) as input / output for the wind farm, and present a stochastic model for the conversion 
process {u{t),(j)) P{t) in the next section. It should be noted that from now on, the analysis is systematically 
conditioned on the wind sector k defined by the ten-minute average direction ^ 


III. MODELING THE CONVERSION PROCESS OF A WIND FARM 


We propose in this section a stochastic approach to model the conversion process of the wind farm (w(<), 0) —)■ Pit). 
We extend a stochastic model originally developed for wind turbines |32j to an entire wind farm, based on observations 
in section HD This method converts the wind speed / direction signals into a stochastic signal of power output, after 


a proper parametrization during a training period. 

Three variations of the stochastic method are presented, see table |T] Model 1 (estimated) generates a power output 
signal Piit) at IHz using a wind speed signal u(t) at IHz. The parameters of model 1 are estimated directly during the 


While we have access to the effective wind direction 0(t) at IHz, our results show that the ten-minute average </> is a more representative 
measure. Wake effects are expected to be slow over the farm size of several km, explaining why we favor cj> over the fluctuating direction 




















6 



20 40 60 80 100 120 

time [min] 



o 

o 


o 

^ 00 


FIG. 5. Two-hour excerpts of (upper) wind speed u{t); 
(lower) power output measurement P{t). The gray lines in¬ 
dicate the IHz signals, and the black lines represent the ten- 
minute averages u and P. One can note the strong correlation 
between the wind speed and power output. 


FIG. 6. Measured data {u{t)-, P{t)} (black trajectory) at a 
sampling frequency of IHz for sector k = 8. The 2-hour 
excerpt from figure p is overlaid (gray trajectory). The lEG 
power curve of figure 3 is displayed for this sector (red curve 
with error bars). 
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training period, see subsection 


III A 


Model 2 (parametric) functions similarly to model 1 except that the parameters 
are no longer those estimated from the training period but a parametric version of them, see subsection |III B[ Model 
3 (parametric 10-min) extends model 2 by including a pre-model that first models a IHz wind speed signal from 
ten-minute wind data, then follows instructions from model 2, see subsection IIIC Model 3 can be used when only 
ten-minute wind data is available. Additionally, a fourth model based on the lEC power curve is used to compare 
to the stochastic models 1-3. All power output signals (one measured and four modeled) are compared in subsection 

mmi 


SIGNAL 

INPUT 

OUTPUT 

wind speed 

wind direction 

power output 

reference (measurement) 

measured u{t) 

measured 10-min cf> 

measured P(t) 

model 1 (estimated) 

measured u{t) 

measured 10-min p 

modeled P\{t) 

model 2 (parametric) 

measured u{t) 

measured 10-min cf> 

modeled P 2 {t) 

model 3 (parametric+lO-min wind) 

measured 10-min u 

measured 10-min p 

modeled Pait) 

lEC power curve 

measured u{t) 

measured 10-min p 

modeled PiEc{t) 


TABLE I. Overview of the different models presented. Each model converts the wind speed and wind direction input signals 
into an output power signal. 


A. Stochastic estimation of the conversion process 

In this subsection, the model parameters are estimated directly from data measured during the training period. 
This approach is the direct extension of the wind turbine model presented in |32) . We observed in figure that 
the IHz trajectory in {m, P} space fluctuates around the lEC power curve PjEciu, (p). Simply put, turbulent wind 
fluctuations drive the farm dynamics away from the power curve, that represents the average behavior aimed by the 
wind farm. Ref. [531IM] recognized this property for single wind turbines and proposed to describe these dynamics 
with a relaxation model towards the power curve. The power curve is simply seen as an attractor, around which the 
system fluctuates. 

Such dynamics can be reproduced by a stochastic drift / diffusion model. In this paradigm, the attractor towards 
some hypothetical power curve P{u) is represented by a drift coefficient while a diffusion coefficient models 
additional fluctuations. These two coefficients (also called Kramers-Moyal coefficients in stochastic theory) can be 
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estimated directly from measured data [351ISS] collected during the training period. They describe the dynamics of 
the power output P{t) conditioned on the IHz wind speed u{t) and the ten-minute average wind direction (j). The 
system evolves in a three-dimensional space with variables P{t), u{t) and The drift and diffusion coefficients must 
then be defined as the three-dimensional arrays 


D^^HP{t),u{t),^) 


^ lini 
n! T ->-0 T 


1 dM<^^\P{t),u{t),(l),T) 
n\ dr 


T—O 


(5) 

( 6 ) 


with n = 1 for the drift coefficient and n = 2 for the diffusion coefficient. As shown in Ref. m, they are the partial 
derivatives with respect to the time increment r of the conditional moment^ 


OO 

= J [P(t-br) - P(t)]” f{P{t + T)\P{t),u{t),^) dP{t + T) 

— OO 


= ([P(t + T)-P(t)] 


P{t) , U{t), 


(7) 

( 8 ) 


where the operator {A\B) represents the conditional mean of A for condition B. The conditional moment {P{t),u{t), (j>, r) 
is the n-th moment of the conditional probability f{P{t + T)\P{t),u{t),^) of the power increment P{t -I- r) — P{t) 
conditioned on the values of P{t), u{t) and 0. 

It is not possible to calculate the derivative in equation ([^ exactly from measured data. The time increment t 
is limited to the smallest value min(r) = !//«, where fs is the sampling frequency. In our case, min(T) = Is, and 
the limit r —>■ 0 cannot be performed. However, a Taylor expansion of the conditional moments gives in hrst-order 
approximation 


= 0) -fr ■ 

dr 


+ o{t^) 


r=0 




dr 


-f o(t2) 


(9) 


( 10 ) 


r=0 

= nW ■ + o{t^) , (11) 

where * stands for the variables P{t), u{t) and 0. The relation = 0) = 0 is obvious from equation For 

small non-zero r values, the derivative in equation (p| becomes to a first-order approximation 






( 12 ) 


Additionally, the discrete nature of measured data has further implications on the applicability of theories developed 
for continuous stochastic processes. It was observed by [3H1 [33] that when estimating the coefficients from a dataset 
that is not sampled with a high enough sampling frequency, an artificial parabolic term is add ed to the diffusion 
coefficient. A simple ansatz exists in order to remove this artifact from where equation (121 is used with a 

second conditional moment modified as |33! 


M^^HPit),u{t),<j3,T) = [P{t + t) - P{t) - T 


P{t) , u(t), 0^ . 


(13) 


This approach is used systematically in this paper in order to minimize finite sampling artifacts. 

Concretely speaking, the three measurement signals P(t), u{t) and 0 are collected during the training period and 
sorted in the three-dimensional phase space {P, it, 0}. Each axis, e.g. the P-axis is split into Np equidistant intervals 
of size AP. The phase space is then cut into Np x x bins (small cubes) of size {AP, Ait, A0}. Let us 
consider one bin {i,j, k) which contains all the data samples that satisfy P{t) G {Pi ± AP/2), u{t) G {uj ± Ait/2) and 
0 G (0fe ± A0/2). From all these data samples, the conditional moments (P^, itj, 0^, t) are calculated following 
equation ([^. Then the coefficients P*^"^(Pi , Uj^ (l)k) can be estimated from the conditional moments following equation 


This definition of the coefficients based on a derivation allows for a more robust estimation in the presence of measurement noise that 
spoils the data. 













(12). This operation is then repeated for each of the Np x x bins, such that the drift and diffusion coefficients 

can be estimated in the entire three-dimensional phase space Here we choose Au = 0.5 m/s, = 30° and 
AP = Pr/50 = 2%. 

The binning procedure allows to characterize the dynamics locally in the phase space. The information is averaged 
following equation Q in each local bin (rather than in time like the lEC procedure). This means that in each bin of the 
power-speed-direction space, the power dynamics are described by the drift and diffusion coefficients. In bin (i,j, fc), 
the drift coefficient D^^\Pi represents the first moment, i.e. the mean value of the power change = 

limT-_>o (*) calculated from all data samples in the bin. Similarly, the diffusion coefficient D^^^{Pi,Uj,4>k) 

represents the variance of the power change. In summary, and represent the mean and variance of the 
power change ■ They quantify how much the power changes on average (drift), and how much deviation is 
expected from that average change (diffusion). Although not mathematically exact, one could conceptualize the 
coefficients as 


Pi,Uj,(j)k), (14) 

((^ 

The drift coefficient is presented in figure]^ In bin if > 0 the drift is represented by a blue arrow 

pointing up because a positive drift means a power increase. Similarly, if < 0 the drift is represented by a red 
arrow pointing down describing a power decrease. The dynamics of the wind farm appear clearly. In all bins located 
below the power curve, the drift is positive and the power increases (on average). Analogously, in all bins above the 
power curve, the power decreases as the drift coefficient is negative. A clear drift towards the power curve appears. 
The drift coefficient is a map of the conversion process, that extends the information of the ten-minute averaged lEC 
power curve to IHz dynamics. A trajectory is overlaid to the drift coefficient in figure to illustrate the concept of a 
drift towards the attractive power curve. The drift coefficient is shown in figure |^a) for one wind speed / direction 
bin and confirms what was observed in hgure 

One can also define the potential of the drift coefficient as 




dt 


Pit Uj , 


(15) 


D^^\PpUj,(j)k) 


IdPii) 


dt 


fP 

T(P,M,,^fc) = - / D^^\P',u,,^u)dP'. (16) 

J —OO 

The potential is a more intuitive representation of the dynamics, see figure [^b). The system tends to minimize its 
potential by drifting towards an attractive fixed point (that corresponds to a local minimum of the potential). Around 
the attractive fixed point, the potential climbs to indicate the repulsion of the system to move away from the power 
curve. However, the system keeps following the non-stationary wind condition (changing j, k bins) and always drifts 
towards a new fixed point. 

Beyond the drift coefficient, the power values towards which the system drifts are of interest. We define this set of 
attractive fixed points as a Langevin power curve Pp^Uj, (j^k) following 




' dDP'>(P,*) \ 

/P=Pi(*) 


< 0 


(17) 


where * stands for [uj, (f>k)- At a fixed point, the drift coefficient is zero. For the fixed point to be attractive, the drift 
must be decreasing (negative derivative with respect to P). On the contrary, a fixed point with increasing drift would 
be repulsive. The system drifts towards the Langevin power curve Ppiuj, (j)k), that represents the basin of attraction 
for the wind speed Uj and wind sector (j)^. The Langevin and lEC power curves have similar values for the data set 
measured here, see hgure 

The dynamics can be seen as a random-like trajectory circulating around the power curve. This is the signature 
of a process that contains both deterministic and stochastic dynamics. The true reason for the random nature of the 
process is not obvious, as wind turbines have deterministic control strategies. At this point, it is important to note that 
the power dynamics are driven by the wind speed dynamics. In an idealized case with a spatially homogeneous, laminar 


^ The estimation is only possible in regions of the phase space that were visited during the training period, i.e. sufficient events are in 
this bin so that the corresponding mean values can be estimated. The training period should then be long enough for the measurement 
to span all the regions of interest. 
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FIG. 7. d rift coefficient D^^\Pi , uj; (f>k) estimated from equa¬ 
tion ( |12[ | for the sector fe = 8 ~ 225°). The drift is posi¬ 

tive (blue arrows upwards) where the power output increases, 
and negative (red arrows downwards) where the power de¬ 
creases. Its stable Hxed points define the Langevin power 
curve PL{uj,<f)k) (black dots), that matches the lEC power 
curve PiEc{uj,4>k) (black line). A IHz trajectory is shown 
(green line). The wind speed bin j = 20 (u ~ 9.75m/s) is 
indicated (dashed gray line) corresponding to figure]^ 



FIG. 8. (a) drift coefficient D^^^{P, uf, cj)k)', (b) drift potential 
'i>{P,Uj-,(f>k) for wind speed bin j = 20 (tt ~ 9.75m/s) and 
sector fc = 8 (^ ~ 225°) (gray line with error bars in case of 
the drift coefficient). Symbols are superimposed for positive 
drift (blue arrows), negative drift (red arrows) and zero drift 
values (black dot). The zero drift and minimum potential 
values are indicated (horizontal dashed line), corresponding 
to the value of the Langevin power curve PL{uj-, (j)k). 


wind inflow, the system should react in a deterministic manner. If this spatially homogeneous flow would see its speed 
change everywhere, the drift field should describe the corresponding change in power output satisfyingly. However, 
this ideal case is not realistic because the wind inflow is turbulent, and contains spatial fluctuations. The wind farm 
does not react to the effective wind speed u{t) averaged over the whole area, but to the local wind fluctuations acting 
on the local blade elements of all the rotors, so to say the wind field U{x,t). The effective wind speed u{t) is a 
macroscopic estimate of the many microscopic influences driving the wind farm (analogously to temperature being a 
macroscopic estimate of thermodynamic fluctuations). When projecting the system dynamics over this macroscopic 
variable, the microscopic degrees of freedom become stochastic fluctuations. 

A macroscopic variable driven by many microscopic interactions can often be approximated by a low-dimensional 
Langevin process We can write a Langevin equation that describes the time evolution of the power output P{t) 
following 

where r(f) is a Gaussian-distributed noise that is uncorrelated, i.e. (r(t)r(t')) = 2S(t — t') and with a mean value 
(r(t)) = 0. The Langevin equation can be integrated numerically in the Ito sense [35] 

P{t + At) = P{t) + At ■ D^^\P{t),u{t), (j)) + \J At ■ D('^'>{P{t),u{t),(j)) ■ r]{t) (19) 

where rjit) is a Gaussian-distributed random variable with mean value (?7(t)) = 0 and variance (?7(t)^) = 2. The 


s It is a direct analogy to the motion of a macroscopic particle within a flow, that was described by Robert Brown in 1827 as Brownian 
motion, or a consequence of synergetic m- _ 

^ The noise term r(t) fluctuates much faster than that is considered constant during the integration step 

At. When integrating the stochastic Langevin equation following Ito definition, 0) • r(t')dt' = 

D(2) (P(t), ii(£), T{t')dt' . Ref. |36| shows that T{t')dt' = \/~Ki ■ v?(t). 
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assumption of Gaussianity can be quantified by higher-order Kramers-Moyal coefficients being zero, i.e. = 0 for 
n > 2 following Pawula theorem [35]. 

The structure of the Langevin equation indicates that the power output P{t) drifts towards the power curve following 
but stochastic fluctuations are superimposed using the stochastic noise r(t) amplified by the diffusion coefficient 
This way the wind farm dynamics captured by the drift / diffusion coefficients can be remodele d by solving the 
Langevin equation. Given any new wind speed signal u{t) and wind direction (f> as input, equation (19) generates a 
synthetic power output signal Pi{t). This serves as a model for the conversion process {u{t), (f)) Some results 

are presented in subsection |III Pj in comparison with other models. 


B. Parametric form of the stochastic model 


The drift and diffusion coefficients D^^\Pi ^Uj](j)k) estimated from equation ( |l2[ ) describe the dynamics of the 
conversion process. A parametric simplification of the coefficients is presented here in an effort to reduce the number 
of model parameters, see figure]^ 

We observe that for a given wind speed bin uj and direction bin the drift coefficient {P, (j)k) is a tolerably 

linear function of P. The stable fixed points are given by the Langevin power curve PL{uj,^k)\\so that the drift 
coefficient can be simplified as 

D^^\P,Uj,^k) - Oijk ■ {P - PLiuj,$k)) , (20) 

where ajk is its slopeThe drift slope ajk is a direct measure of how fast the wind farm can follow wind fluctuations. 
A more negative slope (steeper drift) corresponds to a faster reaction of the wind farm|^ 

Similarly for the diffusion term D^\P, uj, ^k)■, some parametrization can be performed. As illustrated in figure]^ 
the diffusion coefficient is almost constant, such that 


Np 


D^'^^P,Uj,(j)k) ^ Pjk = j^'^D^'^\Pi,Uj,cj)k) , 


( 21 ) 


where the constant f3jk is the diffusion coefficient averaged over the Np power bins. The mean diffusion term fljk 
quantifies the random nature of the power fluctuations. Higher values indicate stronger power fluctuations [] The 
estimation of Pjk is further optimized following appendix A 2 


All the values of a^k and /Sjk are presented in figure llOj Both terms depend only weakly on the wind direction, 
especially at lower wind speeds where wake effects might be less pronounced, ajk grows rather cubically, i.e. ajk oc u^. 
This indicates that the reaction time of the wind farm (the inverse of a) drops cubically with u, stressing that the 
wind farm changes power values much faster at large wind speeds, the diffusion mean jSjk oc u® to counteract the 
growing drift. At large wind speeds u > 13m/s, the diffusion term drops because the pitch mechanism of the turbines 
regulates the power output that fluctuates very little. It is important to note that a larger diffusion term stresses the 
greater need for a stochastic model. At large wind speeds, the stochastic model becomes better-suited than a simpler 
deterministic model because the diffusion term is large. This illustrates the flexibility of a drift/diffusion model, where 
the drift quantifies the reaction time (inertia) of the system, and the diffusion/drift ratio controls the strength of the 
fluctuations. 

The parametrization greatly reduces the number of model parameters. While the two estimated coefficients contain 
each Np x N^ x N^ values, their two parametric forms only requires N^ x N^ values each. This parametrization also 
gives more insight o n th e dynamics of the wind farm. The coefficients simplified with ajk and Pjk are used within the 
Langevin equation (18) to model the conversion process {u{t),(j)) —>■ P 2 {t). Some results are presented in subsection 
III D in comparison with other models. 

In addition, a further simplification of the model could be achieved given the fact that Pp oc for Pp < Pr. Then 
ajk — aoPp and fijk — PoPp, where the parameters oq and /3o depend weakly on the wind direction. If this weak 
direction dependence is dropped, the Langevin model can be further simplified to the two-parameter form 

^^=aoPp{u{t)) ■ (^P{t)-Pp{u{t)))+y%Pp{uit))-r{t). (22) 


^ As observed in figure the lEC power curve PiEci’^j^^k) is very close to the Langevin power curve Phi'^j ■,^k) for tho data set 
measured. For this reason, the lEC power curve could be used instead of the Langevin power curve in equation d20|) if necessary. 

i The condition < 0 must be fulfilled for the model to be stable. This condition corresponds to a converging drift that drives the 
dynamics towards a stable attractor, the Langevin power curve. 

^ For example, when the wind speed is around 12 m/s, ajk — —0.05/s. If the power output P is 20% lower than the ideal value Pl, 
equation J20k indicates that ~ ajk ■ {P — Pl) — 1%/s. When in such conditions, the power output changes on average by ~ 1% 

each second. 

^ Although Pjk quantifies the intensity of the diffusion coefficient D^‘^\P,Uj, ^k)i ^ measure for the variance of the fluctuations is t he 
ratio (Sjkictjk- That is, the competition between diffusion and drift determines the range of expected fluctuations, see appendix . 


A2 
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FIG. 9. (upper) drift coefficient D^^\P,Uj ,(j>k) estimated (symbols and dashed line) for various wind speed bins j (various 
colors) and sector k = 8 {<j) ~ 225°). The power curve value PiEc{uj,cf>k) is indicated (colored symbol +). The linear fit 
proposed in equation (20l is also drawn (bold colored line); (lower) similarly for the diffusion coefficient D^^\P,Uj,(j)k)- The 
mean value j3jk of the diffusion coefficient is represented (colored bold line) following equation (211. 


o 



FIG. 10. (left) slope of the drift coefficient ajk and; (right) mean of the diffusion coefficient Pjk for all direction bins k, as a 
function of the wind speed (bin j). 


The two parameters are oq — —(6.48 ± 0.25) x 10“"^ and /3o — (7.42 ± 0.21) x 10“® for the wind farm. 

The question of the dependence of these two parameters on the nature and size of the wind farm arises, and would be 
of interest for further studies. One should note that for better accuracy, (3o should not stay constant but be reduced 
for u > 13m/s to reproduce the effect of pitching. Results of the simplified model are not presented here. 
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C. Pre-modeling of turbulent wind fluctuations 


Both models 1 and 2 replicate the conversion process (w(f), 0) — P{t)- A practical application consists in predicting 
the power production when the wind condition is known. However, models 1 and 2 require a wind speed signal u{t) 
sampled at around IHz and a ten-minute mean wind direction 0. Yet practical applications summarize the wind 
condition to ten-minute (or even one-hour) averages, and the IHz information is not known. In order to overcome 
such limitation, one must be able to model the wind dynamics at IHz realistically from the ten-minute (or even 
slower) wind data. Such a model is presented in this subsection. 

We developed a stochastic model that generates high-frequency fluctuations of wind speed □ The model super¬ 
imposes high-frequency fluctuations to the ten-minute average, following the description of atmospheric wind speed 
data from [TOlIlT]. Simply put, the wind speed signal is decomposed as 

u{t) = u + a ■ u'(t) (23) 

= u{l+TI -u'it)) , (24) 

where u and a are respectively the ten-minute mean value and standard deviation of the wind speed u{t), and TI 
is the turbulence intensity TI = a/u. In this description, the slow wind dynamics are described by u and TI (that 
change every ten minutes), and the fast turbulent fluctuations are given by u'(t)|^ We observed that the fluctuations 
u'(t) extracted from the measured wind speed u(t) have a distribution reasonably close to the normal distribution 
The autocorrelation function of u' {t) is a rapidly decaying, exponential-like function 

Ru'u'(t) = {u'{t + T)u'{t)) ~ exp(-7T). (25) 

Such exponentially-correlated, Gaussian process is similar to the so-called Ornstein-Uhlenbeck process [5S] that is 
a special class of Langevin processes. The wind fluctuation signal u'{t) can be modeled as an Ornstein-Uhlenbeck 
process following 


du'{t) 

dt 


= -7u'(t)-f V7-r(t), 


(26) 


where T{t) is a Langevin noise, that was already introduced in equation (18). Equation (26) can be integrated 
numerically using the Euler scheme, see equation (19). 

In summary, we propose a simple model for the fluctuations u'{t) of the wind speed signal u{t) (representing the 
effective wind speed over the wind farm). One should be aware that turbulence is in general much more complex, 
as was shown e.g. by n-point statistics |42j . But here we show that this ansatz already works quite well for our 
particular application. While finer approaches can be developed, this simple method suffices here. u'{t) is modeled 
by the stochastic equation (26), where the unique parameter 7 is extracted from the measured wind speed following 
equation @0 The signal It'(t) is then used in combination with the measured ten-minute mean wind speed u and 
turbulence intensity TI following equation (24) to construct a synthetic signal of wind speed u{t) at IHz. When this 
wind speed model is combined with the wind farm model presented in subsection HI B we can model the farm power 
output Psit) at IHz knowing only the ten-minute mean wind speed, turbulence intensity and mean wind direction. 
Some results are presented in subsection |HI D| in comparison with other models. 


D. Statistical validation of the stochastic approach 


We compare the power output signals modeled to the reference power output P{t) measured. Three variations of 
our stochastic model are presented in subsections HI A |HIB and HI C yielding three power output signals T’i(t), H 2 (t) 
and P 3 (t), see table |T] A fourth model is tested using the lEC power curve directly to convert at IHz the measured 
wind speed u{t) into a power output PiEc{t) = PiEc(u{t)). The five signals are compared in identical conditions 
over the total period of the measurement campaign (54 days spread over a total period of eight months, during which 
all turbines operate). 


^ The lEC norm [7] defines wind speed fluctuations as a Gaussian field with spectral properties described by either a Kaimal or a von 
Karman spectrum. 

^ The question whether wind dynamics can be easily separated into small-scale (turbulence) and large-scale (meteorology) effects is still 
an open question. The historical yet controversial hypothesis of a spectral gap m implied such a separation. Some recent studies go 
beyond such simplification and present more complex descriptions such as e.g. multifractal cascade models. 

° The normal distribution is commonly observed in natural phenomena, including the wind speed of some turbulent flows. Yet this 
oversimplified description of turbulence is deceived by a more complex multifractal scaling and small-scale intermittency. Here we 
consider the effective wind speed u{t) averaged over the entire wind farm, whose fluctuations u'(t) are close to a Gaussian monofractal 
process, see also Ref. m- We believe that a random Gaussian model is in this particular case acceptable. 

P The decay parameter 7 should be estimated once for the given location of the wind farm as a measure of the local wind correlations. 
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Excerpts of the five signals are presented in figure [m The time period shown was chosen because it covers a 
wide range of power values (the time series are displayed as a first illustration, whereas the statistical analysis covers 
the entire measurement period). The power signal measured P{t) shows fluctuations at many scales and complex 
statistical features such as volatility clustering in the first five hours. Large power changes of about 50% are observed 
within minutes. These dynamics measured are well reproduced by the first two stochastic models for Pi{t) and 
P 2 it). The third stochastic model for Pz{t) also follows the slow trend with similar fluctuations, but shows some 
minor variations due to the fact that only ten-minute information of the wind was used (the wind fluctuations at 
1 Hz were modeled, see subsection III Cl. The lEC method for PiEc{t) yields mixed results, as the slow trend is 
reproduced, but fast fluctuations are strongly over-represented. These results can be explained easily: the first two 
stochastic models reproduce the measured features because they exploit IHz wind data and they model the wind 
farm dynamics, including its inertia; the third stochastic model also models the farm dynamics, but only has access 
to ten-minute wind data, thus the fastest fluctuations are synthetic; finally, the lEC method does not model the fast 
farm dynamics but only the long-term behavior, as expected from using an average curve. 
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FIG. 11. Excerpts of power output signals sampled at 1 Hz over a continuous time period of lOh. The measured signal P{t) is 
compared to the four modeled signals Pi{t), P 2 {t), Psit) and PiEc{t) (top to bottom). For each signal the vertical axis spans 
the range of power values from 0% to 100%. 


The probability density functions (PDFs) f{P) of the power output signals are presented in figure 12 Except 


for some minor deviations from the parametric results P 2 (t) and Psit), all the models span the same values as the 
measurement with equal probabilities. 

We present the cumulative energy produced as a function of time Ei{t) = Pi{t')dt' in flgure|l3|(£'(t) is calculated 
for the measured power P{t)). The four models produce some deviation of up to 9% in the first 4 days of the simula¬ 
tion, then these deviations drop rapidly over longer integration times. The lEC model produces the best results all 
along for long-term energy production, and matches mostly perfectly the measurement over the total simulation time 
of 54 days, with a deviation of only 0.02%. It is to be expected that the lEC power curve, being an average curve, can 
reproduce the total energy (i.e. mean power) of the data set from which it is estimated. It is followed closely by model 
1 (estimated), that only deviates by 0.3% of the total energy production over the total simulation time. Models 2 and 
3 show a slight deviation of 0.8% and 0.7% from the measurement over the simulation time. While the lEC power 
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FIG. 12. probability density functions (PDFs) f{P) over the 
total measurement period. f{P) was estimated using 50 bins. 


FIG. 13. Relative difference of energy production {Ei{t) 
E{t))/E{t) for the various models i = 1,2,3,/ifC' in [%]. 


curve remains optimal for long-term energy production, minor deviations are found using the estimated model 1. 
Larger deviations are introduced by the parametrization in models 2 and 3. The stochastic models show satisfactory 
results, considering that an integration over about 5 million stochastic samples deviates from the measurement at 
most by 1%. This is an important validation that the stochastic models follow the measurement over time without 
diverging (as tested tens of times for the three stochastic variations). This is due to the nature of the model, and 
particularly of the drift field that constantly drives the dynamics towards the power curve. Properly reproducing 
slow dynamics is a necessary (but not sufficient) condition for a model of the high-frequency dynamics. The integral 
power is a first-oder quantity of the power conversion process. Next we aim to present a deeper statistical analysis of 
the data. In particular we focus on two-time quantities, namely the power spectrum, the autocorrelation function as 
well as the probability distribution of power increments and its corresponding structure functions, for further details 
on the analysis scheme see Ref. m- 
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The power spectral density commonly referred to as spectrum |P(/)P is presented in hgure 
signals. Each spectral value \P{f)\'^ indicates the energy of the corresponding wave of frequency /. 


for the power 
The spectra 


can be described reasonably well by a power law f~^ (linear behavior in log-log scale). The three stochastic models 
have similar spectra, very close to a power law. They are close to that of the measurement, although the measured 
spectrum is more complex, with different scaling behaviors in various frequency bands. The lEC power curve model 
deviates from the measurement, having spectral values up to two orders of magnitude too high; it over-represents the 
intensity of the fluctuations for / > 2.10“‘^Hz, corresponding to fast fluctuations up to roughly 80 minutes. Although 
not shown here, all spectra match at lower frequencies / < 2.10“'*Hz, indicating that they all follow the trend of the 
wind dynamics at 80 minutes and longer time scales. 

The spectrum is related to the autocorrelation function Rpp (r) The autocorrelation functions of the power signals 
are presented in figure |15| The power output measured has long time power-law-like correlations. As an example, 
a power output signal remains correlated to 97% with itself when shifted by one hour. The three stochastic models 
match the measurement very well, indicating that they reproduce the time correlations of the measured power. They 
display power-law correlations close to that of the measurement, as already observed for the power spectral densities. 
In contrary to the good results of the stochastic models, the lEC power curve model deviates from the measurement 
and generates a signal that is less correlated. This deviation can be related to results of the power spectral density, 
where the lEC model was shown to deviate significantly from all the other signals. Particularly, the large drop of 
correlations at r < lOmin corresponds mostly to spectral deviations at / < 2.10“^Hz, as the low-frequency Fourier 
modes control the coarse-grained dynamics. 


^ The Fourier transform P{f) was calculated using the Fastest Fourier Transform in the West algorithm. The spectrum was then 
estimated using Bartlett’s method to obtain a denoised spectrum m- 
^ The autocorrelation function of a stationary signal x{t) is given by Rxxi^) = where ax ^ is the variance of x{t). 

Rxx{t) is the inverse Fourier transform of the spectrum \x{f)\^ of x{t), following Wiener-Khinchin theorem. Because the power signals 
we study are non-stationary, we calculated Rppir) over time windows of 8192s. 
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FIG. 14. power spectral density \P{f)\'^ for the five power 
signals, displayed in log-log scale. The frequency range is 


/e [2- 


Hz. 


FIG. 15. autocorrelation function Rpp{t) for the five power 
signals, displayed in linear-log scale as a function of the time 
lag T. 


Our final statistical test consists in comparing the PDFs of the power increment Pr(t) = P{t + t) — P{t)- The 
power increment FV(t) is the change (increment) of power output P{t) when waiting a time r. The PDF f{Pr{t)) 
indicates with what probability the power output can change over a time scale r[^ The PDFs of the power increments 
are presented in figure for four time scales. The power output measured has exponentially-distributed increments 
(linearly decaying tails in linear-log scale). This is typical for non-Gaussian, intermittent processes that have some 
quiet time periods alternating with some periods of strong fluctuations, as observed for the time series in figure [m 
This notion relates to the intermittent nature of the wind m that is converted into power intermittency by the wind 
farm |4]. At the shortest time scale available r = 1 s, the stochastic models reproduce the increment PDF of the 
measurement mostly well, with a slight over-representation of large power increments for model 1. Model 2 and 3 
match the measurement. The lEC procedure generates much too large power fluctuations at this time scale, with 
increments up to ±20% within one second, when the measurement can change by at most ±5%. At the time scale 
T = 10 s, the stochastic models under-represent large power increments and the lEC procedure still largely over¬ 
represents them. The measured power output can change by ±20% within ten seconds. At the next time scale r = 60 
s, the stochastic models match the measurement very well, except for slightly too many extreme events at Pr > 30%, 
and the lEC procedure comes closer to the measurement. Note that for these strong fluctuations the probabilities are 
obtained from 10 and less events per bin, thus error bars larger than 30% are expected. The measured power output 
can change by ±30% within one minute. At the time scale t = 600 s, the four models match the measurement very 
well, except for the lEC procedure that still slightly over-represents extreme events. The measured power output can 
change by ±60% within ten minutes. We want to stress that the probabilities also show that more extreme events 
are expected when longer data sets are considered. 

Note, a power fluctuation of 5% corresponds to a change of more than IMW here. Fluctuations of several MW may 
effect the local grid stability essentially. The non-Gaussianity of the increment PDFs tells us that even more extreme 
values become likely over longer time periods. This is important for grid stability and puts additional constraints on 
the design of back up methods such as energy storage or overload security. Power gustiness is an important feature of 
wind power production. Rapid power changes affect grid stability, especially on networks that rely strongly on wind 
energy. When considering the entire dataset, we observe power changes of up to 60% within 3 minutes, and up to 
85% within 10 minutes. This stresses the importance of power intermittency on grid stability, and how necessary it 
is to properly model it. 

We promote the three stochastic models that generate some power gusts that have a similar amplitude to that of 
the power gusts measured. These gusts also occur with similar probabilities. Thus our model time series are clearly 
better than the time series obtained from the lEC procedure, which strongly exaggerates fast power gusts in time 


® The second-order 2-point moment < Pr{i)‘^ > corresponds to the autocorrelation and the power spectrum. The PDFs /{FtP)) 
incorporate information about all the higher-order 2-point statistics < P-rit)" >= J PtPP/{P^ dPr- This allows for a thorougher 
validation of our models. 
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scales of seconds. For time scales of ten minutes and more both methods become similar. This confirms that the lEC 
procedure is valid for slow dynamics, and fluctuates too strongly for fast dynamics within one hour. 





FIG. 16. increment PDFs /(FV(t)) at various scales r for the five power signals, displayed in linear-log scale. 

Gustiness and intermittency are related to the scaling of the increments Pr- We define the structure functions 
following [33] 

= (27) 

The power law scaling Sq{T) ~ is observed for all the power output signals, indicating their fractal nature. 
Additionally, the scaling exponent Cg indicates the nature of the fractality N A linear dependence (^q oc. q indicates 
a monofractal signal with self-similar features along scales, implying that the form of the PDFs f^Pr) would not 
change. In this case only the r-dependency of the variance would be needed to characterize the statistics. If C,q is 
instead a nonlinear, concave function of g, the process is multifractal with an intermittent, walk-like behavior at small 
scales and a noise-like behavior at large scales. Consequently we need to characterize /(TV) for all t. The scaling 
exponents are presented in figure compared to Kolmogorov’s 1941 monofractal and 1962 multifractal models for 
ideal local isotropic turbulence. The measured power lies in between the two models, being intermittent but less 
than K62. Models 2 and 3 reproduce very well the multifractality of the measurement. Model 1 is too multifractal, 
probably owing to the (still too) quadratic form of the diffusion coefficient estimated. We observe that model 3 (that 
uses Gaussian wind fluctuations) is multifractal, showing that power intermittency for the wind farm comes from 
the conversion process, rather than from the intermittency in the wind. The lEC model is less multifractal, as the 
small-scale intermittency introduced by the inertia of the wind turbine rotors is not reproduced. 


^ More precisely, we use the extended self-similarity method from Ref. m to remove a slightly anomalous scaling observed in the structure 
functions. This method extracts the scaling exponents from the relation S'g(r) ~ , forcing ^3 = 1. 
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FIG. 17. scaling exponents (^q as a function of the order q 
for the five power signals, calculated for scales in the range 
r £ {256,16384} s. K41 (dotted line) and K62 (dashed line) 
are given for reference. 
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FIG. 18. flatness F{t) for the five power signals, displayed 
in log-linear scale as a function of the time lag r. F = 3 
indicates a Gaussian process (horizontal dotted line). 


The flatness (or kurtosis) F{t) = [13] quantifies directly the non-Gaussian character of the distribution 

see figure [I^ We measure a high flatness up to a value of 12 at short time scales in seconds confirming 
the strong intermittency at short time scales. This value is reduced by a factor two within two hours, and reaches 
the Gaussian limit F = 3 for r > 3 days Model 1 matches the measurement moderately well for r < 100 s, 
and models 2 and 3 deviate more at those scales. At larger scales r > 200 s, the three stochastic models match the 
measurement very well. The lEG model is not intermittent enough for r < 3 hours, then matches the measurement 
at larger scales. 


IV. PERFORMANCE MONITORING 


After characterizing and modeling the power output based on the wind speed, the stochastic approach has a second 
important application for the overall monitoring of the wind park performance based on IHz data. Performance 
monitoring consists in tracking the power performance of the wind installation. A monitoring procedure verifies 
in real time whether the installation is running in its optimal configuration, or if maintenance is needed to fix a 
potential anomaly. Many components of e.g. aerodynamic, mechanical or electrical nature interact in complex ways 
in a wind turbine. A detailed monitoring of each single component yields a huge amount of data to analyze in real 
time. This microscopic monitoring can be advantageously simplified to a macroscopic procedure, where a set of global 
estimates describes the overall system. This reduction of the many variables into a set of global estimates is even 
more meaningful for a large array of wind turbines such as a wind farm. 

The reduced amount of information must be balanced by insightful estimates that properly extract the essential 
information. These estimates must be sensitive enough to detect anomalies, yet robust enough to cope with the 
expected fluctuations. The w ind s peed/power output signals are commonly displayed together as a power curve. The 
lEG power curve (see section II C| ) gives a rapid overview of the overall performance of the conversion process tt —>■ P 
for a single wind turbine. The extensive averaging applied greatly reduces the amount of data, but seriously lessens 
the sensitivity to fine changes. Based on our analysis above, the lEG power curve is expected not to be an op timal 
estimate of the dynamical performance. Instead, we promote the drift coefficient D^^\P,u;(j)) (see section III A) as a 
flexible and accurate alternative. 


“ It is interesting to note that the correlation length of some atmospheric wind datasets was found to be around 4 — 5 days in 12611461 . 
matching our observation for the wind farm output. The autocorrelation function shows Rppp = 4days) ~ 0.25 and Rppp > 
15days) ~ 0. 

'' A local peak is observed for the flatness of the measured data for t ~ 10 minutes. This time scale corresponds to the characteristic 
size of the farm of about 3 km. It could be due to the fact that the farm converts atmospheric structures at this scale without filtering 
them, yielding a local increase in intermittency. This feature is not reproduced by any of the models. 
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Our goal here is to test the sensitivity of both the lEC and the stochastic estimates, see resp. subsections IV A 


and IV B[ First we want to find out how long an anomaly must last, so that it is surely detected. In particular, the 
progressive modification of the estimates is studied in relation to the anomaly duration for both methods. As a second 
aspect, the impact of continuous (uninterrupted) anomalies as well as intermittent anomalies (in the sense that they 
only last during short time periods, here of 1 min) is analyzed. 

A dataset is selected within the wind sector fc = 8 (^ ~ 225°) spanning a discontinuous time period of 14 days, 
when all 12 turbines function properly. For each anomaly, we artificially shut down one or two wind turbines (set 
their power output Pi{t) = 0) during a given duration and recalculate the cumulative farm power. We developed 
around 50 anomaly scenarios and tested their impact on the IFC and stochastic estimates. Four of these scenarios 
are selected here to illustrate the two central aspects: how long must the anomaly last; and how robust are the IFC 
and stochastic methods to anomalies occurring intermittently during short time periods? A proper definition of the 
four scenarios is presented in table |IT] 


anomaly 

description 

shut down 

anomaly duration 

normal operation 

anomaly type 

(a) 

lshutdown 12h 

1 turbine 

12h (3.6%) 

13dl2h (96.4%) 

continuous 

(b) 

lshutdown 4d 

1 turbine 

4d (28.6%) 

lOd (71.4%) 

continuous 

(c) 

1 shut down 4d Tnt 

1 turbine 

4d (28.6%) 

lOd (71.4%) 

intermittent 

(d) 

2shutdown 7d 

2 turbines 

7d (50%) 

7d (50%) 

continuous 


TABLE II. overview of the four anomaly scenarios implemented over a 14-day time period. For each scenario (a)-(d), a 
description is given, as well as how many of the twelve turbines are shut down in the wind farm. The duration of the anomaly 
and of the normal operation are given in time (and in percent). Finally the anomaly type is given, whether continuous in 
time (at the end of the 14-day period) or intermittent (5760 one-minute-long anomalies are spread over the total time period, 
representing a total anomaly time of 4 days). 


We monitor the cumulative farm power output for each scenario, see figure and compare it to the measured 
(normal) farm output. It is important to note that the wind/power conditions constantly change over the 14-day 
period. The four anomalies cover different time periods, such that each scenario affects the power performance at 
different locations in the {u;P} space. It makes it difficult to compare different scenarios from one another. Instead, 
we separately compare each scenario from the measured normal case and a full anomaly case where the shut down 
(of one or two turbines) occurs during the entire 14-day period. 
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FIG. 19. Wind farm power output Pit) for the wind sector fe = 8 (^ ~ 225°) for scenarios (a-d) (left to right). In each anomaly 
scenario, the power output (black line) is compared to the power output measured (gray line in background). The period of 
the anomaly is marked by the gray area (except for (c) where the anomaly happens intermittently all over the 14-day period). 


A. Monitoring procedure based on the lEC power curve 


We compare in figure 20 the impact of each anomaly on the ten-minute means of power output P and on the 
corresponding IFC power curve PiEc{uj,4>k)- We observe that for scenario (a), most ten-minute means lie around 
the power curve of the normal case, and the resulting IFC power curve in figure [2^a2) does not deviate noticeably 
from the normal case. The anomaly only lasts 3.6% of the total time, which is not enough for the IFC procedure to 
detect it. For scenario (b), enough ten-minute means deviate from the normal power curve, such that the IFC power 
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curve is changed significantly by a step-like structure at wind speeds higher than 10 m/s. We found that this is the 
minimal anomaly duration, i.e. 28.6% of the time for the lEC method to detect an anomaly. In scenario (c), the 
anomaly also lasts 4 days but intermittently. In this case the ten-minute means deviate only slightly from the normal 
case, and no outliers are observed. The resulting lEC power curve does deviate slightly from the normal case, but less 
so than for the equally long but uninterrupted anomaly in scenario (b). This is expected from a ten-minute averaging 
that mixes 9 minutes of normal operation with 1 minute of anomaly. This is a limitation of the lEC procedure, that 
cannot separate scenarios alternating faster than ten minutes. Einally for scenario (d) two turbines are shut down 
during 50% of the time. Half of the ten-minute samples are outliers that deviate clearly from the normal case. The 
lEC power curve lies in between the two states, as each state takes place half of the time. The anomaly is clearly 
detected. 










FIG. 20. (upper) ten-minute means of farm power P versus wind speed u (dots) and; (lower) lEC power curve PiEc{uj,<j>k) 
(bold black line) for the wind sector fc = 8 (^ ~ 225°) for scenarios (a-d) (left to right). The lEC power curves are also given 
for the measured normal case (full gray line) and for the full anomaly case (dashed gray line). 


B. Monitoring procedure based on the drift coefficient 


The drift coefficient (P, Uj, (j)k) presented in section III A quantifies the conversion dynamics of the system to 


a first order approximation. It is a good candidate to detect fine changes in the farm dynamics. Additionally, its 
potential 'I'(P, Uj, 0^) gives a more intuitive representation of the attractors of the system, that define the Langevin 
power curve PLiuj,4>k)- 

We compare the impact of each scenario on the drift coefficient, potential and Langevin power curve in figure 
As seen in figure]^ the measured normal case (full gray line) corresponds to a linear drift towards the Langevin power 
value. The extreme case of a full anomaly (dashed gray line) also displays a fairly linear drift towards a new, reduced 
power value. Shutting down some turbines reduces the power production, and the farm naturally drifts towards a lower 
power value. In the four scenarios, the normal and anomaly situations coexist, and the drift coefficient systematically 
records these two states. The anomaly duration acts as a weighting factor that determines how much the drift is 
distorted close to the reduced, anomalous power value. For scenario (a), 3.6% anomaly time is sufficient to modify 
the drift coefficient and potential noticeably at power values P < 95%. We observe for scenario (b) that a longer 
anomaly time of 28.6% brings new fixed points at P ~ 95% in the rated power region. The drift coefficient displays 
multiple stable fixed points, and the potential has several local minima. The intermittent anomaly in scenario (c) is 
also detected in the drift coefficient and potential. Less fixed points are created in the Langevin power curve, due 
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to the fact that the anomaly scatters over the entire state space {u; P}. It is not as localized in the state space as 
the continuous anomaly in scenario (b), so that it affects a larger region of the state space, but to a lesser extent. 
Finally for scenario (d), both dynamical states are recorded by the drift coefficient that clearly detects the anomaly 
superposed to the normal operation. 
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FIG. 21. (upper) drift coefficient D^^\P,Uj,4>k) and; (middle) drift potential 'i>{P,Uj,(j)k) for the wind speed bin j = 28 
(u ~ 13.75m/s); (bottom) Langevin power curve PL{uj,(f>k) for the wind sector k = 8 {(f> ~ 225°) for scenarios (a-d) (left to 
right). In each plot, the anomaly case (bold black line / black crosses) is compared to the normal case (gray line / full gray 
dots) and to the full anomaly case (dashed gray line / open gray dots). The drift potential is arbitrarily shifted upwards for 
comparison (minimum set to zero). In the (lower) figures, the dotted vertical line indicates the wind speed bin j = 28. 
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At last, the intermittent anomaly is considered again, yet this time the duration of the anomaly is varied, see figure 
One sees that when the intermittent anomaly only takes place 6 hours (1.8%) or less, the potential does not 
change. Yet if the intermittent anomaly occurs over a duration of 12 hours (3.6%) or longer, the potential clearly 
deviates from the normal case. The deviation increases with increasing duration, as the potential approaches the 
full anomaly case. This type of dynamical behavior is reminiscent of phases transitions. It is similar to the cusp 
catastrophe described in catastrophe theory, see e.g. Ref. [47]. Thus, the potential follows the generic law of the 
form 'F(P) = P'^ + aP^ + bP, where a causes the bistability of the system. For a bistable system, the slope of the 
drift coefficient changes from negative to positive to negative again (instead of being only negative in the presence of 
only one attractive fixed point). In our case, we see how the parameter b causes the phase transition by shifting 
along the //—axis and also causes the transition from one to two stable fixed points. This kind of bifurcation caused 
by the weighted mixing of two phases was also observed, e.g. for the differential resistance of some nonequilibrium 
semi- and super-conductor systems [15]. 
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FIG. 22. Drift potential 'i/{P, Uj, (j>k) for the wind speed bin j = 28 (u ~ 13.75m/s) and wind sector fc = 8 ~ 225°) for an 

intermittent anomaly of varying duration (see legend). The drift potentials are given for the measured normal case (full red 
line) and the case when the anomaly lasts during the total 14-day period (dashed red line). The curves are arbitrarily shifted 
upwards (minimum set to zero) for comparison. 


C. Summary 

In summary, we conclude from the sensitivity analysis that the drift coefficient is a better estimate than the lEC 
power curve to monitor the power performance of the wind farm. It reacts faster to a change in the farm dynamics 
than a procedure based on ten-minute averaging. This comes from the fact that the lEC method gives an average 
result that deviates slowly. In contrast to this the drift estimate can detect the bistable dynamics, which provides 
much more insight onto the cause of the anomaly, as an emerging bistability clearly indicates that something does 
not function properly. We observe in scenario (a) that the drift coefficient of the entire farm changes significantly 
when one of the twelve turbines is shut down during 3.6% of the time. Scenario (b) illustrates that the same turbine 
must be shut down during at least 28.6% of the time for the lEC method to detect it. Additionally, scenario (c) 
shows that if the anomaly switches the turbine on and off intermittently, the lEC power curve can no longer detect 
it affirmatively. The drift coefficient detects the intermittent anomalies as well as the continuous ones, in both cases 
the anomaly needs only last 3.6% of the time to be detected. For this reason, we promote the drift coefficient and 
the corresponding Langevin power curve as improved macroscopic estimates for wind farm performance monitoring. 


V. CONCLUSION 

The conversion process of an onshore wind farm is analyzed at a sampling frequency of IHz. Clear dynamics appear 
when considering the effective wind speed u and ten-minute average direction (j) averaged over all wind turbines, as 
well as their cumulative power production P. Fast power fluctuations of up to ±20% around the power curve are 
measured continuously, stressing the need for a more evolved description. Feature-rich dynamics are observed at the 
time scales of few seconds where the wind farm reacts to wind fluctuations. We propose a new approach where this 
conversion process (u(t), (j)) P{t) is modeled as a stochastic Langevin process. Two model coefficients are estimated 
from the data set measured, and describe intuitively the wind farm dynamics: the drift coefficient describes the 
attraction towards an attractive power curve, and the diffusion coefficient represents additional turbulent fluctuations. 
In addition, we introduce an additional pre-model of wind speed fluctuations, such that only ten-minute wind data 
is needed to model the power output at IHz. A statistical analysis confirms that the Langevin model reproduces the 
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complex properties of the measurement well, including its intermittent features. Such high-frequency dynamics cannot 
be modeled correctly using a power curve method, that only describes the long-term dynamics. Beyond predicting 
the trend of the wind power production, our model predicts realistic power fluctuations at IHz, so that the impact 
of wind fluctuations on the grid stability can be forecast. Also, the added insight helps understand the dynamical 
response of large wind installations to wind fluctuations, such that smart grid concepts can be further refined. Thanks 
to their general flexible structure, such stochastic methods can be upscaled easily in order to understand and model 
the large wind installations that will populate future electric networks. 

As a second application, we show that the drift coefficient is a compact measure of the global performance of a wind 
installation, making it a powerful tool to monitor the power performance of a wind farm. It captures more information 
about the dynamical behavior of the wind farm than the lEC power curve. For example, the drift coefficient detects 
a significant change in the global wind farm dynamics if one of twelve turbines is shut down during 4% of the time, 
whereas an anomaly time of almost 30% is needed for the lEC power curve to detect it. The lEC power curve is 
also at a disadvantage when in presence of anomalies that intermittently switch off a turbine during a short time, 
because the time averaging it applies strongly reduces the sensitivity to dynamical changes faster than ten minutes. 
On the contrary, the drift coefficient is equally reactive to intermittent and continuous anomalies. Such stochastic 
analysis can be useful for the real-time monitoring of wind farms. It can be applied in addition to existing tools to 
give some first information about the entire farm. If a potential anomaly is detected, the stochastic analysis can then 
be performed in greater detail for single wind turbines. If an anomaly still appears then, a full (microscopic) analysis 
of all the turbine sensors is justified. Also here, a stochastic analysis may be helpful too for damage detection, see 
Ref. [15]. Our approach is fruitful as a first tool to alert of an occurring anomaly. 
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Appendix A: Probabilistic description of the wind farm power production 


1. The Fokker-Planck equation 


A stochastic approach was presented in section in to generate time series of power output for the wind farm. 
The stochastic signals are generated by solving a Langevin equation in time. The Langevin process generated 
is a stochastic drift / diffusion process. The drift coefficient D^^\P,u, tp) and the diffusion coefficient {P,u, (p) 
approximate to a first and second order how the wind farm converts a wind field {u{t), (p) into a power output P{t). 

The Langevin equation being by nature stochastic, the process simulated P{t) has a random character. This means 
that if the same Langevin equation is solved N times with identical wind conditions, the new signal simulated each 
time will always be different to some degree, owing to the random nature of the Langevin noise T{t) in equation (18) 
(but its statistics won’t change). While the value of P{t) at a given time t is random, the probability f{P, t) of having 
a power value P at time t is fixed. The probability density function of the Langevin process P{t) at any given time t 
is uniquely defined as the solution of the Fokker-Planck equation [53 [55] 


\ dP{t)'^ > (Al) 

where the Kramers-Moyal coefficients are estimated following equation (§. The Fokker-Planck equation is the equiv¬ 
alent in the probability domain of the Langevin equation in the time domain. 

If the probability f{P,t) is desired instead of simply one sample value P{t), the Fokker-Planck equation should be 
solved over time]^ An example is presented in figure where the Fokker-Planck equation is solved at IHz using as 


We solve the Fokker-Planck equation using the path integral method presented in Ref. ESj. Given an initial condition f{P,t = 0), we 
calculate f{P,t > 0) by repeatedly solving the equation f{P,t -h dt) = f f(P,t dt\P\t}f(P\t)dP' over an infinitesimal time step 
dt, where the conditional probability f(P,t dt\P',t) is completely described by and One can visualize how the drift and 

diffusion coefficients concurrently contract and stretch the PDF over time. 
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initial condition a Dirac distribution f{P,t = 0) = S{P{t = 0)). We see at six exemplary times t what the measured 
power value P{t) is, and the probability f{P,t) estimated by the Fokker-Planck equation. 


X P(t) 

— f(Rt) 



^ 360 ^ 560 

time t [s] 


FIG. 23. PDF f{P, t) (green curve) at times t = 
[100, 200, 300,400, 500, 600]s calculated from equation 
The power value measured at those times is indicated (black 
cross). The power output measured P{t) is also shown (gray 
line). 



FIG. 24. Histogram of the difference 5P = max f{P, t) — P{t) 
estimated over 36000 samples (lOh). 


The difference SP between the most probable power value given by the Fokker-Planck equation max f{P, t) and the 

SP is rather centered 


measurement P{t) is calculated for 36000 samples, and its histogram is presented in figure 24 


around zero, has a mean value —0.11% and a standard deviation 2.54%. The standard deviation of P{t) is equal to 
CTp = 22.63%, that is 9 times as large as the standard deviation of the difference. This means that the most probable 
power value predicted by the Fokker-Planck equation max f{P,t) deviates from the measurement by only <Jpl9 on 
average. This concludes that the PDF simulated by the Fokker-Planck equation follows the measurement faithfully. 


2. Probability-based optimization of the Kramers-Moyal coefficients 


The drift coefficient D^^\P,u^4>) and the diffusion coefficient D^‘^">{P,u,(j}) are d efined in each bin of the three- 
dimensional phase space {Pi, A parametric form is presented in subsection 


lllB 


where in each sub-domain 


{uj,(j)k}, the drift is simplified to a linear function {P,Uj,4>k) — cejk ■ (P — PL{uj,4>k)) and the diffusion to a 
constant D^'^^P, Uj, (pk) — Pjk- The two parameters ajk and jSjk are fitted from the drift and diffusion coefficients 



power output P [%] 


FIG. 25. PDF of measured power f{P\uj,4>k) (dots) and PDF fstiP) generated by model 2 (dashed line) and model 2 optimized 
(bold full line) for various wind speed bins j £ [9, 27] (various colors) for the wind sector fc = 8 (0 ~ 225°). 
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estimated. However, some artifacts such as low data sampling or the presence of measurement noise within the data 
affect the estimated Kramers-Moyal coefficients, which then deviate from the true coefficients of the system. We 
propose an optimization procedure that corrects this discrepancy by refining the estimation of and 

As long as the system stays inside the given sub-domain {uj,^k}, the local dynamics are dictated by the local values 
of drift and diffusion. During that period of time, the dynamics do not change and the system fluctuates around the 
power curve value PL{uj,^k)- This corresponds to the PDF f{P,t) relaxing to a stationary form fst{P) that is given 

by 


giving 


dfstjP) 

dt 


fstiP) 


D(^){P,u,^) D(^'){P',u,^) 


(A2) 

(A3) 

(A4) 


where A^ is a normalization constant such that fst{P) dP = 1. fst{P) is the stationary solution of the Fokker- 
Planck equation, that corresponds to the stationary case where and do not change in time. In the limit of 
a system that reacts infinitely fast to the changing wind condition, one can consider that between two changes of the 
wind condition, the system has time to relax to its stationary state and wait for the next change. We assume that 
our system is reasonably close to this limit, owing to the low dispersion of the data around the power curve (see figure 
1^. This means that all the data {P\uj,(l>k) contained in the sub-domain {uj,(l>k\ is assumed to have relaxed towards 
its local power curve value PL{uj,^k), and its PDF is 


f{P\Uj,(l)k) 


cx,k{P-PL{u,Ak)y 


(A5) 


The two PDFs f{P\uj,^k) and fst{P) are presented in figure 25 for various wind speed bins. We observe a reasonably 
good agreement between the two PDFs, which can be improves The optimization is based on the fact that the PDFs 
are reasonably close to a Gaussian distribution 


f{P\up(t>k) 


1 



2 ap 2 y’ 


(A6) 


with P and ap the mean and standard deviation of (P|u,-, 0^). Observing the analogy between equations (A5) and 
(A6 ), we optimize the drift and diffusion coefficients following 


DW(P,u„<^fe) = a,fe-(P-P) (A7) 

(P^ Uj Ak) = ajk-crp^ . (A8) 


The stationary PDF calculated from the optimized coefficients is presented in figurej^ We observe a better agreement 
to the measured PDFs, validating the optimization procedure]^ To summarize, we relate the stationary solution of 
the Fokker-Planck equation to the PDF of the measured power data, in each wind speed/direction sub-domain. This 
way we can fine-tune the parametric form of the drift and diffusion coefficients to reproduce exactly the distribution 
of the measured data. This optimization is used for models 2 and 3 presented resp. in subsections |III B| and |III C| 
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